Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

نویسندگان

  • Charlotte Rehm
  • Lena A Wurmthaler
  • Yuanhao Li
  • Tancred Frickey
  • Jörg S Hartig
چکیده

In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In silico screening of G-Quadruplex Structures in Wilms tumor 1 Gene Promoter

Introduction: X-ray diffraction studies have revealed that guanines in a DNA stands may be arranged in quartet and form a structure called G-quadruplexs. Bioinformatics studies suggested the formation of G-quadruplex structure in human crucial genes, including Wilms tumor 1 (WT1). The aim of this study was to in silico analysis of the guanine-rich sequence in the promoter region of the WT1 gene...

متن کامل

G-quadruplexes are specifically recognized and distinguished by selected designed ankyrin repeat proteins

We introduce designed ankyrin repeat binding proteins (DARPins) as a novel class of highly specific and structure-selective DNA-binding proteins, which can be functionally expressed within all cells. Human telomere quadruplex was used as target to select specific binders with ribosome display. The selected DARPins discriminate the human telomere quadruplex against the telomeric duplex and other...

متن کامل

Atomic Force Microscopy and Voltammetric Investigation of Quadruplex Formation between a Triazole-Acridine Conjugate and Guanine-Containing Repeat DNA Sequences.

The interactions of the Tetrahymena telomeric repeat sequence d(TG4T) and the polyguanylic acid (poly(G)) sequence with the quadruplex-targeting triazole-linked acridine ligand GL15 were investigated using atomic force microscopy (AFM) at a highly oriented pyrolytic graphite and voltammetry at a glassy carbon electrode. GL15 interacted with both sequences, in a time dependent manner, and G-quad...

متن کامل

G-Quadruplexes Involving Both Strands of Genomic DNA Are Highly Abundant and Colocalize with Functional Sites in the Human Genome

The G-quadruplex is a non-canonical DNA structure biologically significant in DNA replication, transcription and telomere stability. To date, only G4s with all guanines originating from the same strand of DNA have been considered in the context of the human nuclear genome. Here, I discuss interstrand topological configurations of G-quadruplex DNA, consisting of guanines from both strands of gen...

متن کامل

Identification and toxigenic potential of a cyanobacterial strain (Stigomena sp.)

Cyanobacteria are well known for their production of a multitude of highly toxic substances . The genus Stigomena is regarded as good candidates for producing biologically active secondary metabolites, which are highly toxic to humans and other animals. The carcass of a dog was found at the shore of Lake Ali-Abad, Iran. Biomass from the discovery site appeared to be of cyanobacterial nature. We...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PloS one

دوره 10 12  شماره 

صفحات  -

تاریخ انتشار 2015